Early Address Prediction
نویسندگان
چکیده
Achieving low load-to-use latency with energy and storage overheads is critical for performance. Existing techniques either prefetch into the pipeline (via address prediction validation) or provide data reuse in register sharing L0 caches). These a range of tradeoffs between latency, reuse, overhead. In this work, we present prefetching technique that achieves state-of-the-art performance without additional storage, movement, validation by adding tags to file. Our addition file allows us forward (reuse) load from no keep alive beyond instruction’s lifetime increase temporal coalesce requests achieve spatial reuse. Further, show can use existing memory order violation detection hardware validate prefetches forwards design while also forwarding 32% loads (compared 15% sharing), delivering 16% reduction L1 dynamic (1.6% total processor energy), an area overhead less than 0.5%.
منابع مشابه
Two-Level Address Storage and Address Prediction
The amount of information recorded in the prediction tables of the proposed address predictors turns out to be comparable to the current on-chip cache sizes; for instance, a last-address predictor that records 4.096 64-bit effective addresses uses a 32 Kbytes storage space. Area cost of address predictors is proportional to address width. To reduce their area cost, we will take advantage of th...
متن کاملMemory Address Prediction for Data Speculation
Data speculation refers to the execution of an instruction before some logically preceding instructions on which it is data dependent. Data speculation implies some form of prediction of the data required by the speculative executed instruction and a recovering mechanism in case of misspeculation. This paper shows that load/store instructions are very good candidates for speculative execution s...
متن کاملReturn-Address Prediction in Speculative Multithreaded Environments
There is a growing interest in the use of speculative multithreading to speed up the execution of sequential programs. In this execution model, threads are extracted from sequential code and are speculatively executed in parallel. This makes it possible to use parallel processing to speed up ordinary applications, which are typically written as sequential programs. This paper has two objectives...
متن کاملEarly Load Address Resolution Through Register Tracking
Higher microprocessor frequencies accentuate the performance cost of memory accesses. This paper presents novel, non-speculative techniques that partially hide the increasing load-to-use latency, by allowing the early issue of load instructions. Early load address resolution relies on register tracking to safely compute the address of memory references in the frontend part of the processor pipe...
متن کاملLow power address bus encoding using loop prediction
This paper proposes a loop prediction encoding method for decreasing power consumption on instruction memory address bus. The loop prediction encoding is based on detecting and predicting loop programs. The experiment results show that our method can decrease switching activity up to 81.5% on average, with small overheads on performance and area.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Architecture and Code Optimization
سال: 2021
ISSN: ['1544-3973', '1544-3566']
DOI: https://doi.org/10.1145/3458883